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Abstract. Presentation of set matrices and demonstration of their efficiency 
as a tool using the path/cycle problem. 



Introduction 

Set matrices are matrices whose elements are sets. The matrices comprise abil- 
ities of data storing and processing. That makes them a promising combinatorial 
structure. To prove the concept, this work applies the matrices to the path/cycle 
problem, see[Il[a[a|4l[5l[6l[7l[g[l0l[IIl[l2l[T3l[14l[ISland many others]. The 
problem may be generalized as a problem to find all paths and all cycles of all length 
in form of vertex pairs (start, finish). That is a NP-hard problem because any of its 
solutions will include a solution of the Hamiltonian path/cycle problems [5]. This 
presentation uses set matrices to realize the following plan to solve the generalized 
problem: present the walk length dynamics with a generative grammar, but include 
in the grammar's production rules some path/cycle filters in order to deplete the 
resulting walk language to the indication of path/cycle's presence/absence, only. 

The design's idea may be traced back trough the dynamic programming, the 
Ramsey theory, the formal language theory, and to the icosian calculus [HI [17] . 
Realization of the design requires to maintain a set of visited/unvisited vertices 
and to use that set as a filter in production of the next generation of walks. Set 
matrices satisfy the requirements. Sorting/factoring of the visited/unvisited ver- 
tices into vertex pairs (start, finish) creates a set matrix analog of the adjacency 
matrix. And the especially designed powers of the set matrix create an analytic 
path/cycle filter. The path/cycle language's specification gets a realization in form 
of the easy-to-check properties of the elements of the adjacency set matrix's pow- 
ers. 

The factoring of the set of visited/unvisited vertices into vertex pairs (start, 
finish) may be seen as a walk coloring where colors are the factor-sets. Then, the 
family of algorithms realizing the design can be parametrized with the following four 
extreme strategies: to color the walks with sets of the visited/unvisited start/finish 
vertices. Work [T^ describes a walk coloring with the unvisited vertices. This work 
deploys walk coloring with the visited vertices. 

Worst case for the algorithms is a complete graph. For a complete graph with 
n vertices, the algorithms perform n iterations and, on each of these iterations, 
0(n^)-time processing for each of the -n? vertex pairs. That totals in time O(n^) 
needed for the algorithms to find all paths and all cycles of all length in the form 
of vertex pairs (start, finish). 
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1. Set Matrices 

Let V he a, universal set. Set matrices are matrices whose elements are sets. All 
set operations can be defined on the set matrices. For example, if ^ = (o-ij) s-nd 
B = (bij) are set matrices of the appropriate sizes, then 
Compliment: 



Join: 

Intersection: 
Multiplication: 



AUB = {a,jUbij); 
AnB = (a.y nbi-j), 



(1.1) {AB\,, = 



- where "x" is Cartesian product of sets, etc. More operations can be found in [T5] . 

For the path/cycle problem, the most interesting operation is the set matrix 
multiplication. The operation can be redefined in different ways. In this presen- 
tation, let us use the following multiplication: for set matrices A = {aij)nxm and 
B = (foy )mxfei product AB is the n x k set matrix whose elements are 

nl=i a^f,\Jb^j, i ^ i 

Here and further, symbol {X)ij means (i, j)-element of matrix X. 

Formula 11.11 is the formula of the number matrix multiplication, except "+" is 
replaced with "n", "x" is replaced with "U", and some special cases are taken 
care of. The special cases treatment makes multiplication 11.11 a non-associative 
operation: 

Exercise 1.1. 

{«} A {b} J\{c} J~{9 V J{{c} J-{(1> 

0\y0 M/^Mi-M ^ \ f ^ \ - f ^ 
w j ^ m 1 w A y v {«} 

Let A he a square set matrix. The following iterations define the left and right 
k-th powers of the matrix, fc > 1: 

=T^ = A 



(1.2) >''^\ V, 



Let us estimate the computational complexity of formula 11.21 Multiplication 11.11 
requires 0{n^) operations "U" and "fl". Thus, if tk~i is the number of operations 
needed to calculate {k — l)-th power, then the number of operations needed to 
calculate A;-th power is 
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Thus, the time needed to calculate fc-th power can be estimated as 
(1.3) 0{kn^\V\). 

The list of set matrix operations and properties can be continued. But let us start 
and demonstrate some benefits. 

2. Path problem 

Let g = (y, A) be a given (multi) digraph: V is the vertex set and A is the arc 
set of g. Let the vertex set V be the universal set. Let's enumerate it: 

V = {vi, V2, . . . , Vn}- 

Let G be the adjacency matrix of g appropriate to this enumeration. Then the 
positive elements of powers of G indicate the presence of walks; vertex pairs (start, 
finish) of fc-walks are indexes of positive elements of matrix G'^ . The powers of this 
adjacency matrix can detect a shortest path but not a path of a specific length. 
Also, calculating the powers involves magnitudes of 

0{n''~^{m^ix{G)^J)^). 

Although, the last problem can be solved with the Boolean adjacency matrices [18j . 
Let T be the following set matrix of size n x n: 

{vj}, (G)„ >0 A i^j 
V, (G)„- <0 V i^j 

Matrix T may be seen as an adjacency set matrix. Let T'' be the fc-th right power 
of matrix T, defined with formulas 11.21 



(2.1) in 



Lemma 2.1. In digraph g for k < n, if set {T )ij ^ V, then the set is equal to 

where the intersection is taken over all ordered number samples 

IJ. = (ah,M2, • • ■ ,Mfc-i,Mfc) 
which satisfy the following constrains: 

1 < Ma: ^ "-7 a; = 1, 2, . . . , fc 

{vi,Vf,^) e A, (wp^,u^^+J e A, x = 1, 2, . . . , fc - 1 

^J,x 7^i, x = 1,2, . . . , fc 

Mfe = j 

- where set A is the arc set of digraph g. 
Proof. Due to definitions 11.21 and 12.11 if 

{T'h - n u (Ti)^,^, u . . . u (ri)^,_,^,_, U (Ti)^,_,^, ^ V, 

then there are number samples /i = {fii, ■ ■ ■ ,Mfc-i,Mfc) which satisfy the first 
four constrains, and 

(2.2) iT%, = fl {v^, } U } U . . . U J U {v^,}, 
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where the intersection is taken over all those number samples. Proving the last 
constrain will prove the lemma. To do so, let's use mathematical induction over k. 

For fc = 1, due to definitions 11.21 and 12.11 {T^)ij V iS there are arcs from 
vertex Vi into vertex Vj and the arcs are not loops (i ^ j). Then, (T^)ij = {vj} 
and {vi,Vj) G A. Thus, the lemma holds for k = I. 

Because of an irregularity in the powers definition, the induction has to start 
from k = 2. In this case, due to definitions 11.21 and 12.11 if 

{T%,^f]{T%U{T'),,^V, 

7 

then there are such indexes 7 that 

i#7. 7#J, {vi,v^)eA, (v^,Vj)(£A 

where A is the arc set of digraph g. Thus, the lemma holds for k ~ 2. 

Let's assume that the lemma holds for allA;<TO— l<n— 1, and let {T"'-)ij ^ V. 
Then, due to decomposition [221 

(r™)., = n u {v^j u . . . u J u {v^^} + V, 

where the intersection is taken over some number samples /x, satisfying the first 
four constrains. Then, there is such number sample /i that 

{^'Mil u KJ u . . . u {vp^^J u = z^v. 

Then, due to decomposition 12.21 for any of such number samples /i, the following 
holds: 

(T"~^)zM„_, C {v^,}\J {v^,} U . . . U J QZ^V, 

and 

{T'^-^)^n^^ C U {^^3} U . . . U {v^^^ ^Z^V. 

Then, due to the induction hypothesis, both number samples 

and 

satisfy all five constrains. Particularly, 

MxT^My, ^ x^y, = l,2,...,m-l; 
Mx ^ My; ^ xi^y, a;, y = 2, 3, . . . , m; 
and, due to the third constrain for (7"™^^);^ip,„ 7^ V , 

Ml 7^ Mm = j- 

Thus, the whole number sample m satisfies the fifth constrain. That concludes the 
induction and proves the lemma for all k < n. □ 

Lemma |2. II allows the following interpretation: 

Lemma 2.2. In digraph g, if {T^)ij ^ V , then there is a k-path from vertex Vi 
into vertex Vj . 

Proof. The constrains in lemma I^TT] are the definition of a path from Vi into Vj. □ 
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Lemmas 12.11 and 1 2.21 show that matrices T*^ conect the vertex-bridges. That may 
be interesting for the graph toughness theory [6l [15] . 

Lemma 2.3. In digraph g, if there is a k-path from vertex Vi into vertex Vj then 
Proof. Let the following vertices constitute a fc-path from vertex Vi into vertex Vj : 

Indexes of these vertices satisfy the constrains in lemma 12.11 Then, due to defini- 
tions O and O 

{T%j = n u (Ti)^,^, u . . . u (ri)^,_,^,_, U (Ti)^,_,^, c 

c u {v^,}u . . . u {v^,}u J c y - ^ V. 

□ 

Theorem 2.4. In digraph g for k > 1, there are k-paths from vertex Vi into vertex 
'"j iff 

{T% + V. 

Proof. The theorem aggregates lemmas 12.21 and 12.31 Let us notice that case k > n 
is covered by lemmas 12.11 and 12.31 

k>n ^ r'= = {V)nxn- 

□ 



(2.3) (i?). 



Estimation 11.31 shows the computational complexity to detect the fc-paths with 
theorem 12.41 Particularly, when k — n — 1, the theorem detects the existence or 
absence of Hamiltonian paths in time 

0(n5). 

All the results can be repeated with the left powers of matrix T. Also, definition 
12.11 uses the arc finish vertices. Obviously, the results can be repeated with the 
start vertices using the following set matrix instead of matrix [27T] 

{v,}, (G)y >0 A i^j 

V, (G)y < V i = J 

Colorings 12.11 and 12.31 cover two of the four extreme strategies of walk coloring: to 
color walks with the visited start /finish vertices. Another two extreme strategies 
are discussed in |18] . They produce the same results but in terms of the compliment 
sets. 

3. Cycle problem 

Obviously, the solution of the path problem described in section [5] solves the 
cycle problem, as well. Let us formalize that analytically. 

Let's define another set matrix multiplication: if A and B are set matrices of 
appropriate sizes, then 



(3.1) {AB), 



_ / a (^),:.U(B),,, i^j 
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And let us define the following walk coloring: 

, . _ ( {"Loop"}, (G)y >0 A i^j 
^ ^'■'■"l V, (G)., <OVz^j' 

(3.2) 5"^ = S", 8''+^ ^ T'^R^, k>l, 

- where set matrices T*^ and were defined in section[21 and matrix multiplication 
13. H is used. 

Theorem 3.1. In digraph g for k > 1, there are k-cycles attached to vertex Vi iff 

{S'^h + V. 

Proof. Case when fc = 1 is obvious. Let A: > 1. 

Necessity. Let a fc-cycle be attached to vertex Vi, and let the cycle visit the 
following vertices in the order shown: 

Vi_i-^=i, W^2J ■■■I ^MfcJ ^^A»fc+i=i- 

Then, the last k vertices in the row constitute a (A; — l)-path from into Vi. Thus, 
due to lemma ET] and theorem 12^ 



On the other hand, due to definition 12. 3[ 

{R%^, ^{v,}^ V. 

Thus, due to definition 13.11 
Sufficiency. Let 

(3%, = f]iT''-%u U iR%, + V. 
Then, there is such number v that 

Then, due to theorem 12.41 there is a (/c — l)-path from Vi into Vi,\ and, due to 
definition 12.31 there is an arc from v^, into Vi. The path and arc create a fc-cycle 
attached Xo Vi. □ 



Estimation 11.31 gives the computational complexity of theorem 13. II Particularly, 
when k = n, the theorem detects the existence/absence of Hamiltonian cycles 
in time 0{n^). But some simplifications are possible. The existence/absence of 
Hamiltonian cycles can be detected by only calculating any one string of matrix 
yn-i^ That reduces the time needed to solve the Hamiltonian cycle problem to 

0(n4). 
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Conclusion 

The paper presented set matrices as an efficient tool for solving the combinatorial 
problems. The matrices were used to solve the path/cycle problem in polynomial 
time: 

fc-path: Calculate set matrix T*-' with formulas 12. f I and If .21 Use theorem 12.41 

to detect all fc-paths in form vertex pair (start, finish); 
fc-cycle: Calculate set matrix S'' with formulas imfL^IOllXfl andlX^ Use 
theorem 13. f I to detect all vertices which have a fc-cycle attached. 
Boolean property "It is equal to the vertex set" of the elements of matrices T'' and 
S'' fulfill the path/cycle language's specification: indicate the presence/absence of 
paths/cycles. For a graph with n vertices, it will take 0(n^ )-time to write down 
the whole language in form of 0{n) matrices of size nxn filled with 1 and 0: 1 will 
mean the existence of appropriate paths/cycles and will mean their absence. 
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